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NONLINEAR ZERO-SUM DIFFERENTIAL GAME ANALYSIS BY SINGULAR 

PERTURBATION METHODS 
J. Shinar* and N. Farber+ 

Ames Research Center 


SUMMARY 


A certain class of nonlinear, zero-sum differential games, exhibiting time-scale 
separation properties, is suitable for analysis by singular-perturbation techniques. 
The merits of such an analysis, leading to an approximate game solution, as well as 
the "well-posedness" of the formulation, are discussed. The singular-perturbation 
approach is shown to be particularly attractive for investigating pursuit-evasion 
problems; the original multidimensional differential game is decomposed to a simple 
pursuit" (f ree-stream) game and two independent (boundary- layer) optimal— control 
problems. Using multiple time-scale boundary-layer models in such games results in a 
pair of uniformly valid zero-order composite feedback strategies. Though a priori 
perfect information is assumed, the suboptimal strategies depend only on relative 
geometry and own-state measurements. This is demonstrated by a three-dimensional, 
constant-speed example. For game analysis with realistic vehicle dynamics, the tech- 
nique of forced singular perturbations and a variable modeling approach is proposed. 
Accuracy of the zero-order singular-perturbation analysis is evaluated by comparison 
with the exact (numerical) solution of a time-optimal, variable-speed "game of two 
cars" in the horizontal plane. 


I. INTRODUCTION 


The notion of differential games, coined by Isaacs (ref. 1), was intended ini- 
tially to describe a mathematical framework for investigating pursuit-evasion problems. 
Today, the general theory of differential games has a much larger horizon, but 
pursuit-evasion games still attract a particular interest. However, the progress in 
solving nonlinear games, representing realistic problems, has been slow and frustrat- 
ing. Although it has been proved (refs. 2 and 3) that pursuit-evasion games with 
separable convex dynamics and properly defined terminal manifolds have a saddle-point 
solution, only games with very simple structures have been solved (refs. 3-6). 

The basic reason for this unsatisfactory progress is the complex nature of the 
game solution. First, a nonlinear, two-point, boundary-valve problem of high dimen- 
sion, describing the set of necessary conditions of optimality, has to be solved. 

Such a solution, generating candidate extremals, is valid only if those trajectories 
do not reach singular surfaces of the game. And the existence of such surfaces of 
discontinuity (either in the cost function or in its gradient) is a well-known 
phenomenon (refs. 7 and 8) in differential game theory. For this reason the numerical 
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techniques (refs. 9~12) , which can provide a solution of the two— point boundary-value 
problem, are of limited value. They also require an excessive amount of computation, 
prohibitive for real-time implementations, and provide only little insight for a sys- 
tematical analysis. For such purposes a reasonable analytical approximation may be 
of great value. 

In analysis of differential games, where fast and slow variables can be dis- 
tinguished, the method of singular perturbation — successfully used in optimal control 
^rme— scale systems (refs. 13—17) — can be applied. By this technique the 
differential game can be decoupled to a set of lower— dimensional, analyti- 
cally solvable subgames, enabling the synthesis of an approximation to the exact 
solution. 

There are, however, some conceptual difficulties to be overcome. The first 
attempts to investigate singularly perturbed linear differential games (refs. 18-21) 
revealed that because of the differences in the information structure between an 
open- loop and closed-loop solution, some singularly perturbed differential games, may 
not be well— posed (i.e., the solution of the "slow" game does not approximate the 
solution of the original one, even if the perturbation parameter, characterizing the 
time— scale separation, tends toward zero). "Well-posedness" was, however, demon- 
strated (ref. 18) for zero-sum linear games and it was conjectured (ref. 20) for cases 
in which the "fast" game has identical closed-loop and open- loop solutions. 

Other studies (refs. 22 and 23) clarified the significance of an approximate game 
solution. By defining the outcome of a singularly perturbed, two-person, zero-sum 
differential game in which both players follow approximate (synthesized.) strategies — 
as the "extended value," it was demonstrated that 

1. The difference between the extended value and the exact saddle-point value 
is bounded on both sides 

2. The extended value satisfies by itself a weak saddle inequality 

3. The extended value tends, as a limit, toward the exact value of the singu- 
larly perturbed game , as the singular perturbation parameter e approaches zero 


4. The accuracy of the extended value for a zero-order approximation is pre- 
dicted to be of the order of e^. 

A class of pursuit-evasion games, analyzed by the technique of singular perturba- 
tions, can be characterized by the following properties: 

1 . There is a time-scale separation between the variables describing the slow 
relative geometry and the fast variations of vehicle dynamics 

2. The dynamics of the relative geometry are separately controlled by the 
players 

3. The dynamics of the fast variables are independent of the slow ones 

4. The fast vehicle dynamics are independently controlled by the respective 
players 

5. The terminal surface is defined by the slow variables only. 
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For this class, the original high-dimensional game can be decomposed to a slow 
"simple pursuit game" and to two sets of fast, independent, optimal-control problems 
satisfying the conjecture of reference 20. 

The objective of this paper is to summarize the results of singular-perturbation 
analysis for a set of pursuit-evasion games of gradually increasing complexity. In 
all of these games, capture is defined as the pursuer approaching the evader within a 
predetermined distance and is assumed to be guaranteed. The cost to be optimized is 
the "time of capture." 

The scope of the paper is, however, limited to two recently completed investiga- 
tions. In section II the three-dimensional, constant-speed pursuit-evasion game is 
analyzed, and it is shown how the singular-perturbation solution (ref. 24) enlightens 
the apparent controversy raised in previous works (refs. 25-27). The results o a 
comparison (ref. 28) of the zero-order feedback approximation, obtained by singular- 
perturbation analysis (ref. 29), and the open-loop optimal solution of variable-speed 
horizontal pursuit-evasion game (ref. 30) are presented in section III. This compari 
son has validated the singular-perturbation approach for the analysis of complex 
pursuit-evasion games for sufficiently large initial distances. In the concluding 
section, the merits of such game analysis are summarized. 


II. THREE-DIMENSIONAL "GAME OF TWO CARS" 


Summary of Two-Dimensional Results 

As a first example of a singularly perturbed pursuit-evasion game, the two- 
dimensional, constant-speed game with constrained turning rates (Game of ^o Cars) was 
analyzed in reference 22, since its optimal solution (ref. 31) was available for com 
parison. This optimal solution, obtained for a sufficiently large initial distance, 
requires that each player uses his maximum turning rate until the final line of sight 
is reached. Then the game continues as a straight line pursuit. 

The singular-perturbation parameters in this problem are the ratio of the turning 
radii of the players to the initial distance of separation. The approximate strategy 
pair for this game, obtained by using the singular-perturbation technique, is given 
in a feedback form. It directs the players to align their velocity vectors with the 
current line of sight. If both players can do it simultaneously, the approximate 
strategies yield a result which is identical to the exact solution. Othewise, the 
extended value will shift in favor of the player who reaches the current line of 
sight later than his opponent. Numerical comparison showgd that the payoff accuracy 
of the approximation is better than the predicted order of e . This successful 
demonstration of merits of the singular-perturbation approach encourapd investigation 
of more complex pursuit-evasion game models, such as (1) three-dimensional, constant 
speed; (2) two-dimensional, variable speed; and (3) three-dimensional, variable speed. 

In the following subsection the first game model is formulated, using vector 
notations. 


Problem Formulation 

The equations of motion of the constant-speed pursuit-evasion in a three- 
dimensional space are 
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( 1 ) 


dR 

^ =. V - V : 
dt . E * 


^(to) 


-V 

= 


dV 

IT = “p " • ^p('o) 


= V. 


dVp 

^ = “e ^ 


V. 


( 2 ) 

(3) 


The angular velocity vectors {jL)p,o)£ are perpendicular to the respective vectors of 
linear velocity, that is. 



(4) 


and they are constrained in magnitude, 



(5) 



( 6 ) 


The game terminates at t = tf, when the two following conditions are satisfied for 
the first time: 


|^(tf)i = d 


(7) 


|R(tf)| < 0 (8) 

It was shown (ref. 32) that if |Vp| > jVgl and fip > fig then a finite capture 
time can be always guaranteed. In the sequel, it is assumed that these inequalities 
are respected. It permits the definition of a game of degree (ref. 1) in which the 
cost to be optimized is the time of capture tf, defined as 

/ = tj = min arg |R(t) | = d (9) 

t>to 1^0 I 


The controls of the players are cop and cop, respectively, and their optimal values 
have to satisfy the necessary saddle-point conditions of 

min max,}^- max min,^= 0 (10) 

tOp 0)£ 03g 0)p 

where Jif is a scalar function (the Hamiltonian of the game) given by 

<7^ = 1 + ^R* (Vg - Vp) + . (Wp X Vp) + • (mg X Vg) (11) 

P E 

The adjoint vectors components of the gradient of the cost func- 

tion (assuming that such a gradient exists) and they have to satisfy 
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dt 
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dt 


'' 
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= const 


3V R \ l:- Vp/ 


9^ 




(“E '' V) 


(12) 


(13) 


(14) 


The transversality conditions, applied at the terminal surface defined by equa- 
tion (7), lead to 


tgUf) II S(t£) 


X (tj) - X, (tj) = 0 
P E 


(15) 

(16) 


It can be thus concluded from equations (12) and (15) that X^Ct) is a constant vector 
parallel to the final line of sight and that its magnitude can be determined from 
equation (10) applied at the terminal manifold, 


\ - iVp(tf) - Vp(tf)] = 1 


(17) 


The optimal-control strategies are obtained from equation (10), using also equa 
tions (5) and (6): 




\ ''p 


(18) 


“ ~^E It 


Ve " ''e 


(19) 


X 


I 

X V 


These expressions reflect the required perpendicularity expressed in equation (4). 

The set of necessary conditions of game optimality and the equations of motion 
represent a nonlinear, two-point, boundary-layer problem with 6 three-dimensional 
vector variables. In the next subsection the optimal solution is approximated, using 
the singular-perturbation technique. 


Zero-Order Feedback Approximation 


The first step in applying singular-perturbation analysis is the transformation 
of the original problem to exhibit a singularly perturbed structure. The results of 
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the two-dimensional analysis (ref. 22) suggest that if the turning radius of the 
players, defined by 

r = |v|/|u| (20) 

IS small relative to the initial distance of separation |^o|, there is a time-scale 
separation between trajectory and vehicle dynamics which can be demonstrated by a 
scaling transformation. Such a transformation can be avoided by using the technique 
of "forced" singular perturbations, proposed first by Kelley (ref. 33) and used since 
in many optimal-control problems (refs. 34-37). This method is based on the insertion 
of a unit-value, "artificial" perturbation parameter as a multiplier of the respective 
time derivatives of the fast variables. Judicious ordering of the state variables, 
representing the actual hierarchy of the time-scale separations, is an essential pre- 
requisite for a meaningful approximation. Recently it was formally demonstrated 
(ref. 38) that the "forced" singular-perturbation approach and the properly scaled 
transformation of the variables (using singular-perturbation parameters of physical 
significance) yield identical zero-order feedback solutions. The example of the 
constant-speed two-dimensional game solved by this method in reference 23 illustrated 
this equivalence. By using this technique, equations (l)-(3) can be transformed to 


II 


R(tp 

= K 

(21) 

dV 





dt 

(“p ^ Vp) ; 

Vp(to) 

II 

o 

(22) 

dt 



c 

II 

(23) 


By setting e = 0 the reduced-order game is obtained (variables of this game are 
denoted by the subscript o) yielding 


dR° 


(V; - Vp ; R°(t^) = R^ 


dt ' E P' 


(24) 


and the constraints 


(oj; X Vp = ( 0 )° X vp = 0 
Combining equations (25) and (4) leads to 


(25) 


= U3° = 0 


(26) 


In this reduced game, the controls are Vp and Vg, and they have to satisfy 

min max ^ = max min ,y€° = 0 


->■0 




V V 


where 


= 1 + t° • (V° - Vp 


(27) 


(28) 
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The adjoint vector satisfies an equation similar to equation (12), and conse- 

quently it is a constant vector, parallel to the final line of sight of the reduced 
game R®(tf). 


Optimization of the reduced-order Hamiltonian leads to 



(29) 



(30) 


where e^ is a unit vector in the R® direction. We also obtain from equation (27) 
that 


t: 



(31) 


The solution of the reduced game is therefore a straight-line pursuit. Substi- 
tuting equations (29) and (30) into equation (21) indicates that the direction of the 
line of sight does not change in this subgame, that is, S®(t£) is parallel to 
and consequently 


e®(tf) = (32) 

Since equations (29) and (30) are generally incompatible with the prescribed 
initial conditions and a boundary-layer game has to be introduced, using a 

time-stretching transformation 



(33) 


By substituting equation (33) into equations (21)-(23), the following set of equations 


(variables 

are denoted by 

the 

superscript 

i) 

is obtained: 




dR^ 

dT 

= e(^i - Vj) 

> 

o 

II 

(34) 



dx 


9 

Vp(0) = V 
^ o 

(35) 



dx 

= X V^) 

9 

v^(0) = 

(36) 

Setting e 

= 0 leads to 









S^(t) = const 

= K 

(37) 
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The Hamiltonian of this boundary- layer game is similar to equation (11). It is 
obtained by expressing the adjoints of the singularly perturbed game equa- 
tions (21)- (23) (denoted by the superscript e) as 




te _ t 

Ay eXy 

P P 


(38) 


Xy eX 
E '^E 


= 1 + tj. (^i - ^J) + tj . (;:;J X ^j) 

P E 


(39) 


The adjoint equations of the boundary layer are 



(40) 


(41) 


(42) 


The matching condition requires 





(43) 


where X^ is already determined by equations (31) and (32). The other adjoint vec- 
tors have to satisfy 


lim 

tj (T) 

- X (t^) 

= 0 

(44) 

X-x» 

P 

P 



lim 

tj (X) 

X (tj) 

= 0 

(45) 

J-XX) 

E 

E 




The optimality conditions, similar to equation (10), lead to 


(cOp)* = flp 





(46) 
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( 47 ) 


.1 ^ 

£ 


It can be already seen that equations (35), (41), (44), and (46) are independent of 
equations (36), (42), (45), and (47), and consequently each of the equation sets can 
be solved separately. 

Substituting equations (43) and (46) into equation (41) yields 




dx'' ■ S 




p / p 


I X I 
\\ Vpi 


(48) 


Using the triple vector product identity 


(a X 5) X c = a(&- c) - ?(a • c) 


(49) 


we can rewrite equation (48) in a simplified form. 




(50) 


where kj^ and k 2 are scalar multipliers. Integrating equation (50) backwards from 
Xyp(tf) = 0, we observe that initially the derivative vector lies in the plane 

defined by and Vp. Consequently, the vector A^p itself will grow (in the 

retrograde sense) in this plane only and can be expressed by 


tj (T) = k3(T)X° + k4(T)Vp 


R 


(51) 


Substituting equation (51) into equation (A6) reveals that 


^i 


X vj 


(52) 


Similarly we obtain for the evader 


o ;^i 


X X V 
(-i)* = o ^ 

R '^E 


(53) 


Since by equations (31) and (32) is parallel to R° , we have the control strate- 
gies in the form 
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(54) 


(wp)* 


= 


R XV 
o 

1^0 >= ^ 


(“E>* 


= Q. 


li - ^1 


(55) 


These boundary- layer strategies imply that each vehicle should turn in the plane 
defined by the initial line-of-sight vector and its own velocity. A uniformly valid 
composite zero-order feedback solution (denoted by the superscript c) is obtained if 
the initial condition is replaced in equations (54) and (55) by the current 

value of the line-of-sight vector R (and the superscripts i omitted): 


->c 

(jJ 

P 


= 


R X V, 


It X s 


->c 

“e 


R X 


= Q 


E 


R X V 


(56) 


(57) 


The accuracy of this approximation depends on the time-scale separation of the 
variables, that is, on the smallness of the perturbation parameters 


Ej = r^/lR^I j = P , E 


(58) 


where r is defined in equation (20) . 


The Optimal Solution 

To our knowledge, the complete optimal solution of the game has not been pub- 
lished. Moreover, the results that are available have created some controversy. In 
several investigations (e.g. , refs. 26 and 27) it was indicated that the three- 
dimensional game tends to take place in an inclined plane. In a recent paper 
(ref. 28) it is shown, however, that the optimal strategies of this game, starting 
with noncoplanar initial conditions, are essentially three dimensional. The singular- 
perturbation analysis can resolve this apparent contradiction. As a matter of fact it 
extends the original two-dimensional solution of Simakova (ref. 31) to a three- 
dimensional space. Applying the same approach that was taken in the boundary-layer- 
game equations (48)-(54) to the exact-game equations (1)-(19) shows that the optimal 
maneuvers of the players take place in two fixed planes, determined by the final 
(a priori unknown) direction of the line of sight and the respective velocity vectors. 
This open-loop solution, expressed in a nonrotating coordinate system, embeds the 
results of references 25-27 , showing that they are complementary and do not contradict 
each other. The optimal strategies obtained in reference 27 are expressed in a coor- 
dinate system aligned with the line of sight and, therefore, the existence of the 
fixed maneuver planes is not transparent. It also agrees with the observation that if 
the game starts in (or reaches) a plane, the optimal trajectories are two dimensional. 
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It is also obvious that if the ratios of the turning radii of the respective 
players to the initial distance of separation (i.e., the singular-perturbation param- 
eters of the game) are small, the rotation of the line-of-sight vector is negligible, 
and the current direction of this vector, used in the zero-order feedback strategies, 
is a good approximation to its final orientation. 


III. VARIABLE-SPEED HORIZONTAL PURSUIT-EVASION 


Singular -Perturbation Analysis 

In variable-speed pursuit-evasion games there exists also a time-scale separation 
between the velocity and turning dynamics of the players. For such cases the method 
of forced singular perturbations is particularly rewarding since it has the potential 
to generate a multiple time-scale differential game. Such a singularly perturbed 
game, consisting of several consecutively faster subgames, is solvable in a closed 
form. An example of the variable-speed pursuit -evasion game in the horizontal plane 
was analyzed first in reference 29. Only recently has it become possible to validate 
this analysis (ref. 28) by comparing its solution with the open- loop optimal solution 
of the same game (ref. 30). 

Because of the limited scope of the present paper, only the main results of the 
singular-perturbation analysis and the more important aspects of the comparison are 
summarized in this section. For further details the reader is referred to the 
original report (ref. 28). 

For the forced singular-perturbation analysis, the variable-speed pursuit-evasion 
game between two airplanes in the horizontal plane is described by the following 
multiple time-scale mathematical model (variables are defined in fig. 1): 


R = Vg cos(Xg - - Vp cos(Xp - 

9 

R(t ) = R 

(59) 

ij) = [Vp sin(Xp - i|)) - Vp sin(Xp - 4')1/R 

9 

'Kto) = 

(60) 

eVp = ” "p(Dj^)p]/Wp 

9 

= ^P 

o 

(61) 


9 

o 

(62) 

e^Xp = g(n^ - l)'/VVp 

9 

Xp(to> = Xp 

o 

(63) 

c\ = g(n^ - l)^/VVp 

9 

“ ^E 

(64) 


In this model, the maximum thrust of the airplanes as well as their 

zero-lift drag "Dq" and the straight-flight-induced drag is a known function of 

the velocity. Weight W is assumed to be constant. The control variables of the 
players are the throttle setting 


0 < 5 < 1 


(65) 
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( 66 ) 


and the aerodynamic load factor (or lift to weight ratio) 

n = (L/W) 

This last control variable is subject to two different constraints; a structural limit 


n < n 


max 


(67) 


and an aerodynamic (speed-dependent) one: 


n < n^(V) 


( 68 ) 


Singular perturbation analysis (refs. 28 and 29) leads to an approximation of the 
optimal-control strategies in an explicit feedback form. These strategies are similar 
for both players. They require full thrust 

= 1 (69) 


and a load factor given by the equation 


(n"")" = 1 + 


- V 


T - (D + D.) 
max o 

D. 

1 


[1 - cos(i|) - x)] 


(70) 


but subject to the constraints of equations (67) and (68). In this expression, V 
is the velocity in the reduced-order game, which is estimated to be the maximum 
velocity of the aircraft. 


As can be seen, equation (70) expresses the required load factor as an explicit 
function of the state variables (i|j,x>V) and the performance parameters (thrust, drag) 
of the airplanes. Note that no range measurements or estimation of the opponent's 
parameters are required. Based on equation (70), a simple feedback chart, as shown 
in figure 2, can be prepared for each airplane. 


The very form of equation (70) provides an important insight into the nature of 
the suboptimal maneuver, which can be summarized as follows: 


1. The required load factor can be related to the highest constant speed (sus- 
tained) turning performance since 


X(v = 0) = I 


T 

max 


(D + D . ) 
o 1 

D. 

1 


(71) 


indicating the compromise between a fast turn and longitudinal acceleration, both 
being necessary for a successful pursuit-evasion. 

2. The function [1 - cos (i]7 - is almost linear for - x\ < 60°. 

Therefore, the required turning rate can be considered, for small deviations from the 
line of sight, as a proportional control with a speed-dependent gain. 

3. A part of this gain, given by V/(V^ - V), provides an additional insight. 

It indicates that if the speed is near its predicted value in the reduced game (i.e., 
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when there is a little need for acceleration), a faster turn can be made. However, 
if (v^ - V) is large, it is better not to lose velocity by making a sharp turn, but 
to use the excess thrust for acceleration. 

4. The control strategies (eqs. (69)- (70)) predict that the trajectory of both 
airplanes will asymptotically approach the line-of-sight vector and that the engage- 
ment will end by an accelerating "tail chase." 


Comparison with the Optimal Solution 

The zero-order feedback approximation obtained by singular-perturbation analysis 
was compared recently (ref. 28) with the open- loop optimal solution of the same game 
(ref. 30). This solution is based on using the direction of the terminal line of 
sight as a reference. In this particular coordinate system, the necessary conditions 
of game optimality are decoupled into two one-sided optimization problems. Thus, 
extremal trajectories can be generated by backward integration from the terminal sur- 
face independently. A recent implementation of this approach (ref. 39) makes it pos- 
sible to compare trajectories and time-histories of variables to suboptimal candi- 
dates, such as the zero-order feedback approximation. Initial conditions of an 
example are depicted in figure 3. Those initial conditions were obtained by simul- 
taneous backward integration of two different trajectories (one for the pursuer and 
one for the evader) from different end conditions satisfying Vp(tf) > VE(tf) , using 
i(j 0 j^rical aircraft model from the open literature (ref. 40) . Note that the initial 
separation in the x-direction can be adjusted by selecting different capture ranges 
(Rf = d) . 

The time-histories of the control variables seem to be similar, even for a rela- 
tively short initial range (only 4 times larger than the pursuer's turning radius), 
as can be seen in figures 4 and 5. Inspection of turning time-histories in figures 6 
and 7 reveals more differences between the optimal solution and the feedback approxi- 
mation. The evader, using the suboptimal strategy, reaches the actual line of sight 
very nearly at the same time as the open— loop optimal trajectory closely approaches 
the final (reference) direction. His turning strategy can, therefore, be considered 
as close to optimal. One cannot say the same about the pursuer. Since the initial 
conditions of the pursuer are more favorable than those of the evader, the pursuer 
can reach the line of sight in a shorter time. Because the evader has not yet com- 
pleted his turn, the line of sight continues to rotate. Consequently, the pursuer, 
using a suboptimal strategy, has made an unnecessary turn (an overshoot) which has to 
be corrected later. As a result, it can be expected that the capture time of the 
suboptimal game will be longer than the optimal one. Indeed, for this example, the 
time of capture predicted by the feedback approximation is about 6.5/ higher than the 
optimal value of 97.1 sec. 

The major reason for this level of inaccuracy is the relatively large value of 
the singular-perturbation parameter (cp = rp/Rg = 0.25) in the example. By changing 
the capture range R£, and accordingly the initial distance of separation Rq, a large 
set of pursuit— evasion games is generated and the suboptimal outcomes of the games 
are compared with the optimal time of capture. The results of this comparison, pre- 
sented in figure 8, demonstrate a very satisfactory accuracy for small values 
(ep < 0.125) of the perturbation parameter. Since in this example the turning 
requirement of the pursuer is less than that of the evader, the suboptimal strategy 
of the pursuer deviates more from the optimal than that of the evader. Consequently 
the actual time of capture obtained when both players are using zero— order feedback 
strategies is longer than the optimal value. 
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In summary, the zero-order, singular-perturbation strategies preserve all of the 
essential elements of the optimal solution and provide a good payoff accuracy for 
sufficiently large initial distances. 


CONCLUSIONS 


It has been shown that singular-perturbation methods are useful mathematical 
tools for analyzing a class of interesting nonlinear pursuit-evasion games. The 
explicit analytical form of the zero-order control strategies indicates the simplicity 
of an eventual implementation. This point is emphasized by the fact that in spite of 
the assumption of perfect information made in the game formulation, the suboptimal 
strategies depend only on the relative geometry and on own-state measurements. 

The analytical approximation generates an insight , which hardly can be obtained 
from an open-loop optimal solution. This point has been clearly illustrated in both 
examples discussed here. The payoff accuracy of the zero-order, singular-perturbation 
approximation was demonstrated for a variable-speed pursuit-evasion game in the hori- 
zontal plane. 

Variable-speed pursuit-evasions in the vertical plane and in three-dimensional 
space were also analyzed recently by using a variable-modeling forced singular- 
perturbation technique. This last investigation is reported in a contemporary 
paper (ref. 41). 
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Figure 1. Geometry of horizontal pursuit evasion. 
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Figure 2. Feedback chart for suboptimal turning 



Figure 3. Initial conditions for example. 
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Figure A. Evader's load factor time-hxstory . 
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Figure 5. Pursuer's load factor time-history. 
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Figure 6. Evader's turning time-history. 
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7. Pursuer’s turning time— history . 
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Figure 8. Payoff accuracy of the zero-order feedback solution. 


25 


2. Government Accession No, 


3. Recipient's Catalog No. 


1. Report No. 


NASA TM 84271 


4. Title and Subtitle 

NONLINEAR ZERO-SUM DIFFERENTIAL GAME ANALYSIS 
BY SINGULAR PERTURBATION METHODS 

5. Report Date 

July 1982 

6. Performing Organization (^de 

7. Author(s) " 

J* Shinar and N. Farber 

8. Performing Organization Report No. 

A-8998, 


10. Work Unit No. 

9. Performing Organization Name and Address 

T-33094 

NASA Ames Research Center 
Moffett Field, California 94035 

11. Contract or Grant No. 


13. Type of Report and Period Covered 

12. Sponsoring Agency Name and Address 

National Aeronautics and Space Administration 
Washington, D.C. 20546 - 

Technical Memorandum 

14. Sponsoring Agency Code 

505-34-11-11-00 

15. Supplementary Notes 


Point of contact: Dr. Heinz Erzberger, M/S 210-9, 

Ames Research Center, 

Moffett Field, CA 94035, (415) 965-5450, FTS 448-5450 


16. Abstract 


A certain class of nonlinear, zero-sum differential games, exhibiting 
time-scale separation properties, is suitable for analysis by singular- 
perturbation techniques. The merits of such an analysis, leading to an 

game solution, as well as the *'well— posedness** of the formula- 
tion, are discussed. The singular perturbation approach is shown to be 
Particularly attractive for investigating pursuit— evasion problems j the 
^^riginal multidimensional differential game is decomposed to a '^simple 
pursuit" (free-stream) game and two independent (boundary-layer) optimal- 
control problems. Using multiple time-scale boundary-layer models in such 
games results in a pair of uniformly valid zero-order composite feedback 
strategies. Though a priori perfect information is assumed, the suboptimal 
strategies depend only on relative geometry and own-state measurements. 

This is demonstrated by a three-dimensional, constant-speed example. For 
game analysis with realistic vehicle dynamics, the technique of forced 
singular perturbations and a variable modeling approach is proposed. 
Accuracy of the zero-order singular-perturbation analysis is evaluated by 
comparison with the exact (numerical) solution of a time-optimal, variable- 
speed "game of two cars" in the horizontal plane. 


17. Key Words (Suggested by Author(s)) 18. Distribution Statement 

Nonlinear game analysis Unlimited 

Pursuit-evasion games 

Singular perturbations Subject category 05 

Feedback strategies 

19. Security Qassif. (of this report) 20. Security Classif. (of this page^ 21. No. of Pages 22. Price* 

Unclassified Unclassified 29 


•For sale by the National Technical Information Service. Springfield, Virginia 22161 























